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ABSTRACT 

Although  bio-inspired  legged  robots  have  advantageous  mobility,  they  can  be  very  inefficient.  Their  intrinsic  walking  mobility  is 
sometimes  outweighed  by  the  inefficiency  of  their  drive-train.  Some  of  these  inefficiencies  are  due  to  collision  losses,  but  they  are 
also  due  to  suboptimal  powering  schemes.  This  paper  addresses  the  powering  schemes  and  seeks  to  clearly  delineate  an  optimal 
solution  to  powering  the  walking  motion  of  a  two-legged  or  biped  walker.  We  examine  a  simplified  model  of  locomotion  called  the 
“rocket  car”  to  extract  the  meaningful  parameters  that  affect  time  and  energy  cost.  Using  Pontryagin’s  Maximum  Principle,  we 
dissect  the  cost  function,  the  state  equation,  co-state  equation,  and  control  input  constraints  to  describe  the  optimal  control.  The 
result  of  the  paper  shows  a  “bang-off"  control,  and  we  describe  the  “coasting  line”  between  these  extremes.  It  is  not  possible  to 
find  a  complete  closed-form  solution  for  the  problem,  and  numerical  methods,  such  as  dynamic  programming  must  be  used  for 
future  simulation  and  visualization  of  the  results. 


INTRODUCTION 

Legged  Robotics  has  long  held  the  promise  of  mobility  [1]. 
Until  recently,  most  research  in  legged  robotics  has  focused 
on  stability.  Recent  research  [2]  has  come  to  the  conclusion 
that  stability  is  a  necessary,  but  not  sufficient  condition. 
Efficiency  is  a  key  factor  that  must  not  be  overlooked. 
Particularly,  we  examine  efficiency  in  the  powering  scheme 
of  a  legged  robot. 

As  a  rough  order  of  magnitude  (ROM)  estimate,  we 
approximate  the  powering  scheme  for  a  legged  system  with 
the  so-called  “rocket  car”  problem  [3].  This  deliberately 
simplifies  the  problem  by  leaving  out  potential  energy 
changes  in  the  system.  We  will  adjust  the  problem  to  reach 
the  origin  with  maximum  velocity,  instead  of  zero  velocity, 
to  simulate  the  transport  of  mass  as  quickly  as  possible 
through  a  single  stride. 

Poyntryagin’ s  Maximum  Principle  (PMP)  will  be  used  to 
study  the  control  scheme,  which  is  the  foundation  of  modem 
control  theory  and  has  its  roots  in  the  solutions  of  problems 
required  for  space  technology  (hence  the  name  “rocket  car”). 
This  is  an  arcane  mathematical  principle,  which  is  best 
understood  from  an  economics  point  of  view.  The  interested 
reader  is  directed  to  a  few  lucid  treatments  in  [4-5]. 

POSITIONING  PROBLEM  WITH  A  TIME  AND 
KINETIC  ENERGY  COST  FUNCTION  (j) 

In  the  textbook  treatment  of  the  rocket  car  problem 
problem,  the  cost  function,  (f) ,  is  assumed  to  be  for  a  time 
optimal  problem  (TOP),  fuel,  energy,  or  a  mixed  case  of 
these.  The  standard  treatment  for  the  energy  cost  function  is 


(j)  =  11^ ,  which  turns  out  to  be  the  case  if  M  is  a  current 
source.  In  this  paper,  we  derive  a  treatment  for  energy 
where  M  is  a  control  force  instead.  We  show  that  the  cost 
function  for  energy  is  (j)  =  UV  . 

Given: 


X  m 

- NO - ►  ^ 


Figure  1:  Positioning  Problem 

Required: 

Drive  the  mass  back  to  the  origin  at  .r  =  0  with  maximum 
velocity,  while  minimizing  some  integral-type  cost,  J  , 
where: 

T 

J  =  j^(x,w)Jr  (1) 

0 

In  this  problem,  we  choose  two  components  for  the  cost 
(time  and  energy) 

+  (2) 

Substituting  for  power 

J  =  \dt+\—dt  (3) 

•'  ^  dt 


Optimal  Time  and  Energy  Efficiency  in  Legged  Robotics,  Muench,  et  al. 


Page  1  of  4 


Report  Documentation  Page 

Form  Approved 

0MB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  0MB  control  number. 

1.  REPORT  DATE 

17  AUG  2009 

2.  REPORT  TYPE 

N/A 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

Optimal  Time  and  Energy  Efficiency  in  Legged  Robotics 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

Paul  Muench;  Gregory  Czerniak;  Ka  C  Cheok  PhD 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

US  Army  RDECOM-TARDEC  6501  Ell  Mile  Rd  Warren,  MI 
48397-5000 

8.  PERFORMING  ORGANIZATION  REPORT 
NUMBER 

20072 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

TACOM/TARDEC 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release,  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

Presented  at  NDIAs  Ground  Vehicle  Systems  Engineering  and  Technology  Symposium  (GVSETS),  17  22 
August  2009, Troy,  Michigan,  USA,  The  original  document  contains  color  images. 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION 

OF  ABSTRACT 

SAR 

18.  NUMBER 
OF  PAGES 

4 

19a.  NAME  OF 

RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98} 

Prescribed  by  ANSI  Std  Z39-18 


Proceedings  of  the  2009  Ground  Vehicle  Systems  Engineering  and  Technology  Symposium  (GVSETS) 


Substituting  for  kinetic  energy 


Noting  that  mv  =  ma  =  u 

I  mvvdt  =  I  uvdt 

Einally,  substitute  the  above  result  into  (1)  to  yield 

T 

J=\(k  +  uv)dt 

0 


(4) 

(5) 

(6) 


Where  k  is  a  positive  constant.  Thus  we  have  two 
components  for  the  cost,  with  a  constant  k  to  measure  the 
relative  importance  between  the  two. 

POYNTRYAGIN’S  MAXIMUM  PRINCIPLE  (PMP) 

7‘”  step  in  the  Poyntryagin  Maximum  Principle:  Treat  cost 
as  an  additional  state.  The  state  equations  are  thus  given  by: 


k+uv'^ 

i^k  +  UX2  ^ 

X  = 

= 

V 

1 

= 

^2 

1^2; 

1  M  ; 

1  “  y 

Since  we  want  to  approach  the  origin  with  maximum 
velocity,  U  is  bounded  by  zero  [6]. 

-1<M<0  (8) 


In  other  words,  while  you  may  want  to  back  off  the  gas, 
you  would  never  want  to  apply  the  brakes. 

2"“'  step  in  the  Poyntryagin  Maximum  Principle:  We 
introduce  the  co-state  variables  to  form  the  Hamiltonian: 

H  =  Zpi:,)  -I-  Zjij  -I- 12^2  (9) 


Substituting  (7)  we  see  that 

H  =  ZQ{k  +  UX2)  + Z1X2  + Z2U  (10) 


The  co-state 
equations: 


equations  are  prescribed  by  Hamilton’s 


^0 


dH 

dx^ 

dH 

dx, 


=  0 


=  0 


(11) 

(12) 


dH 

Z2=-—  =  -Z,-ZqU  (13) 

dx2 

5'^“'  step  in  the  Poyntryagin  Maximum  Principle 

Solve  the  above  co-state  equations.  The  equation  for 
Zq  shows  that  =  const ,  and  the  PMP  requires  that  this 
constant  should  be  negative.  Without  loss  of  generality  we 
can  choose  Zg  =~1.  The  two  solutions  to  (11)  and  (12) 
are  thus: 

Zo=-1  Zi=A  (14) 

The  solution  to  the  last  equation  (13)  is  given  by 
substituting  (14),  which  yields  Z2  =U—  A,  and  the  solution 
becomes 

Z2=^udt-At  +  B  (15) 

We’ve  previously  shown  J  udt  =  mv  —  mVg  ,  so  that 

Z2=mx2+B-mvQ-At  (16) 

4"’  step  in  the  Poyntryagin  Maximum  Principle:  Eind  the 
supremum  of  H  as  a  function  of  U  .  The  Hamiltonian  is 
given  by  substituting  (14),  (16),  and  (7)  into  (10): 

H  =  -k-ux2  +Ax2  +u(mx2  +  B  -mv^  -  At)  (17) 


To  maximize  H  as  a  function  of  M  ,  we  must  maximize 
the  term  uq  where 

q  =  {^m-\)x2  +  B-mvQ-  At  (18) 


If  m  =  1  then  q'  =  B'—  At  where  B'  =  B  —  mVg  and 
since  q’  is  a  linear  function  of  t  there  is  at  most  one  zero 
crossing  for  q ' .  Thus 


-1  if 
0  if 


q'<0 

q'>0 


(19) 


PMP  has  thus  shown  that  we  have  “bang-off’  control  with 
a  single  switch.  This  makes  intuitive  sense,  since  we  would 
want  to  apply  the  force  early  on  in  order  to  gain  the  time 
savings  accrued  throughout  the  stride,  but  we  would  want  to 
shut  the  force  off  after  some  time  period  in  order  to  save 
energy. 
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OPTIMAL  COST 

We  desire  to  find  the  optimal  cost  7* .  We  assume  the 
optimal  solution  is  “bang-off’  as  described  in  the  previous 
section.  Eirst,  we  proceed  with  a  bang,  i.e.,  from  the  initial 
position  X  to  the  switching  position  S  we  apply  full  reverse 
force  iu=—\),  and  then  at  5 ,  we  cut  the  engines  off 
( M  =  0 )  and  coast  through  the  origin  with  a  cruising  speed 
of  “a.”  as  shown  in  the  phase  plot  in  Eigure  1 . 


Figure  2:  The  Switching  Curve 


We  can  rewrite  the  cost  integral  (6)  as  a  function  oix 
instead  of  t : 


J  = 


(20) 


dx 

where  we  have  used  the  definition  of  velocity  V  =  —  and 

dt 

the  definition  of  work  E  =  ^  F  ■  ds  =1  udx  . 

Working  backwards  from  the  initial  position  X  at  zero 
velocity  until  the  switch  at  position  S  and  velocity  a ,  the 
time  cost  becomes: 


r  ^  dx  ,  r  av  ,  r  av  , 

\k  —  =  km\  —  =  km\  —  =  —kma  (21) 
i  V  i  u 


f  dv 


dv 


which  uses  the  physics  result  ads  =  vdv  recast  as 
udx  =  mvdv . 

The  energy  cost  for  this  period  is  given  by: 

s  a  2 

I  udx  =  mj  vdv  =  (22) 


There  is  only  time  cost  as  we  cruise  at  velocity  a  from 
position  s  to  the  origin: 


dx 

a 


ks 

a 


(23) 


Putting  together  the  above  three  equations  we  get  the 
optimal  cost: 

2 

J*  =-kma  +  m— — k—  (24) 

2  a 


This  will  always  be  positive  since  a  is  negative. 

We  really  would  like  a  result  that  is  only  in  terms  of  the 
initial  conditions.  So  we  seek  a  formula  that  relates  5  to  a. 
Integrating  udx  =  mvdv : 


(25) 


We  find  the  relation: 


^  =  X- 


ma 


(26) 


Substituting  the  above  result  for  S  into  (24): 


,  a  k 

J  =  -kma  +  m - 

2  a 


f 


X- 


ma 


,2  A 


(27) 


Simplifying: 


,  a  k  kma 

J  =  -kma  +  m - X  -I - 

2  a  2 

kma  a^  kX 

= - l-m - 

2  2  a 


(28) 


We  desire  a  result  that  minimizes  J  (a)  : 

dJ*  ^  km  kX 

- =  0  = - 1-  ma  -i — — 

da  2  a 


(29) 


We  can  check  this  result  using  implicit  differentiation  of 
(24) 


dr 

da 


=  0  =  —km  +  ma  —  k 


^  s' 


\ 


\a  a  J 


(30) 


Which  we  substitute  into  (30)  along  with  S  ’  =  —ma  : 

^  X  m 


0  =  -km  +  ma  -  k 


-m- 


(31) 
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Simplifying; 


0  =  —kp{  +  ma  +  kp{  +  k 


^  km  kX 

0  = - 1-  ma  H 


X  m 


\a 


(32) 


2  a~ 

which  is  the  same  as  (29) 

This  result  yields  the  following  cubic  equation  for  a  : 


ma 


3  km 


a^+kX  =0 


(33) 


This  equation  must  be  solved  numerically  to  describe  the 
“coasting  line”  shown  in  Figure  2,  in  terms  of  the  initial 
conditions  of  the  problem. 

CONCLUSIONS  AND  FUTURE  WORK 

Describing  the  “coasting  line”  only  gives  an  initial  insight 
into  the  problem.  This  must  be  followed  by  numerical 
analysis  to  check  these  results,  and  verify  the  theory.  Other 
models  including  friction,  position  uncertainty,  and 
disturbance  would  add  to  the  richness  of  the  problem  and 


increase  its  difficulty.  The  authors  wish  to  thank  Grant 
Gerhart  and  Rob  Karlsen  for  funding  this  research  through 
the  In-Lab  Innovative  Research  (ILIR)  program  at  US  Army 
RDECOM/TARDEC. 
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